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Figure 1: BLAST result against NCBI non-redundant database using SEQ ID NO: 6 (the 
INSP 108 polypeptide). 



BLASTP 2.2.2 [ Jan-08-2002] 



Reference: Altschul, Stephen F . , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= INSPl08.pp 

(77 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,267,376 sequences; 405,046,914 total letters 

Searching done 



Score 



Sequences producing significant alignments : 



ref |NP_697019. 
ref |XP_141520. 
gb I AAM93917.il 
emb|CAB72350.2 
ref jNP__473453 
ref |NP_660139. 
sp|Q95LlO|D118 
ref )NP_631968 . 
ref 1NP_689464. 
gb|AAN33114.1| 



Ij defensin, beta 123; defensin, beta 23 [Homo sap. 
1| similar to defensin, beta 123; defensin, beta 2. 
defensin beta 124 {Homo sapiens] 

| dJ1018D12.3 (a putative novel protein) [Homo sap. 
1| epididymus specific clone 42; chromosome 20 ope. 
1| defensin beta 119; testis-specif ic beta-def ensi . 
_MACMU Beta-def ensin 118 precursor (Epididymal sec. 
11 defensin beta 15 [Mus musculus] >gi | 19171622 | em. 
1| defensin, beta 106; defensin, beta 6 [Homo sapi. 
beta-def ensin 106 [Homo sapiens] 
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Figure 2: Alignment between INSP108 polypeptide sequence (SEQ ID NO:6) and 
defensinbeta 123 {Homo sapiens). 



>ref |NP_697019.1| defensin, beta 123; defensin, beta 23 [Homo sapiens] 
splQ8N6881D123_HUMAN Beta-def ensin 123 precursor { Beta-def ensin 23) (DEFB-23) 
gb) AAM93916.1 | defensin beta 123 [Homo sapiens] 
Length = 67 

Score = 50.8 bits (120), Expect = 4e-06 

Identities - 26/67 (38%), Positives = 33/67 (48%), Gaps - 10/67 (14%) 

Query: 6 VLLALWLNLLFYVPPGRSGPNVYIQKIFASCWRLQGTCRPKCLKNEQYRILCDTIHLCC 65 

+LL L VL LL + PG + CW L G CR +C K E+ + C +CC 

Sbjct: 4 LLLTLTVLLLLSQLTPGGT QRCWNLYGKCRYRCSKKERVYVYCINNKMCC 53 



Query: 66 VNPKYLP 72 

V PKY P 
Sbjct: 5 4 VKPKYQP 60 
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Figure 3: Sig P cleavage site prediction for INSP108. 



>INSP108.pp 



SjgnalP-NN prediction (euk networks): INSP108.pp 
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>INSP108.pp length = 70 

# Measure Position Value Cutoff signal peptide? 
max. C 25 0.933 0.33 YES 

max. Y 25 0.828 0.32 YES 

max. S 9 0.991 0.82 YES 

mean S 1-24 0.864 0.47 YES 

# Most likely cleavage site between pos. 24 and 25: GRS-GP 



SignalP-HMM prediction (euk models): INSPlOS.pp 
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rwsglon prob. 
h-region prob. 

c-ff*glon prob. 
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>INSP108.pp 

Prediction: Signal peptide 
Signal peptide probability: 1.000 
Signal anchor probability: 0.000 

Max cleavage site probability: 0.906 between pos. 24 and 25 
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Figure 4: BLAST result against NCBI non-redundant database using SEQ ID NO: 14 (the 
INSP109 polypeptide). 



BLAST P 2.2.2 [ Jan-08-2002] 



Reference: Altschul, Stephen F . , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

Query= INSPl09.pp 

(78 letters) 

Database: All non-redundant GenBank CDS 
translations+POB+SwissProt+PIR+PRF 

1,267,376 sequences; 405,046,914 total letters 

Searching done 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



dbj | BAC37510. 1 ) unnamed protein product [Mus musculus] 

ref |NP_062702.1 | defensin beta 4; beta defensin-4 [Mus musculus]... 

emb 1CAA08 905.1I beta defensin-2 [Capra hircus] 

ref j NP_348802 . 1 1 Uncharacterized protein, homolog HI1244 from Ha... 
gb j AAG10514 . 1 1 AF288371_1 beta-def ensin 4 variant [Mus musculus] 
emb ICAD23115.1 1 blue cone opsin [Cottus kesslerii] 

ref 1NP_689464 . 1| defensin, beta 106; defensin, beta 6 [Homo sapi... 
gb | AAM33114.il beta-def ensin 106 [Homo sapiens] 

ref 1XP_163302.1| hypothetical protein XP_1 63302 [Mus musculus] 
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Figure 5: Alignment between INSP109 polypeptide sequence (SEQ ID NO;14) and 
defensin beta 4 (Mus mus cuius). 

>ref(NP_0 62702. 1 | defensin beta 4; beta defensin-4 [Mus musculus] 
sp|P82019|BD04_MOUSE Beta-def ensin 4 precursor (BD-4) (mBD-4) 
gb|AAD38852.1|AFl55882__l beta defensin-4 [Mus musculus] 
gb|AAG02197.1|AF287475_l beta-def ensin 4 precursor [Mus musculus] 
dbj | BAB26051 . 1 1 unnamed protein product [Mus musculus] 
db j | BAB26207 . 1 j unnamed protein product [Mus musculus] 
Length = 63 

Score =32.7 bits (73) , Expect = 1.1 

Identities = 19/56 (33%), Positives = 27/56 (47%), Gaps = 1/56 (1%) 

Query: 8 LLFFLVILLPSGKGMFGNDGVKVRTCTSQKAVCFFGCPPGYRWIAFC-HNILSCCK 62 

LLF +++L S F TC + A+C+ CP +R I C H + CCK 

Sbjct; 6 IiLFTFLLVLLSPLAAFTQIINNPITCMTNGAICWGPCPTAFRQIGNCGHFKVRCCK 61 
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Figure 6: SIg P cleavage site prediction for INSP109. 



>INSP109.pp 
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SignalP-NN prediction (euk networks): INSP1G9.pp 
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length = 70 
Value Cutoff signal peptide? 
0.84 9 0.33 YES 
0.854 0.32 YES 
0.994 0.82 YES 
0.953 0.47 YES 



# Most likely cleavage site between pos . 21 and 22: GKG-MF 



SignalP-HMM prediction (euk models): !NSP109.pp 
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Prediction: Signal peptide 
Signal peptide probability: 0.999 
Signal anchor probability: 0.001 

Max cleavage site probability: 0.619 between pos. 21 and 22 
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Figure 7: Predicted nucleotide sequence of INSP108 with translation 



1 ttgttccaaa aggttcacta gccatgcagc tccccgtctc ttcaaagctg cggagagagt 

61 gactctccga tgagtcacag ctgettcttt gctgattggt atggccacaa ggagcgtcct 

mat r s v 

= ) » 

121 cttggccctc gtggtcctta acttactctt ctatgttcca ccaggtagaa gtggacccaa 
1 1 a 1 v v 1 nil fyvp pgr s g p 

181 tgtctacata caaaaaatct ttgcttcatg ttggcgactg caaggtactt gccggccaaa 

n v y i q k i fas c w r 1 q g t c r p 

241 atgtctaaaa aacgaacaat atcgtatttt gtgtgatact atacatttgt gctgtgtaaa 

k c 1 k n e q y r i 1 c d t i h 1 c c v 

301 cccaaaatat ttacctatac tgactgggaa atagttgtga gtacctgaaa gctgttgctg 
npky lpi Itg k 

361 atttcctctg ggaacccaga tccctctcag ttgcaccatt cgattaaaac aatggcttta 
421 gcctatcagt gttc 



Position and sense of PCR primers 
Underlined : signal sequence 
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Figure 8: Nucleotide sequence with translation of INSP108 PCR product cloned using 
primers INSP108-CP1 and INSP108-CP2. 



1 attggtatgg ccacaaggag cgtcctcttg gccctcgtgg tccttaactt actcttctat 
m a t r s v 1 1 a 1 v v 1 n 1 1 f y 
_ ► 

61 gttccaccag gtagaagtgg acccaatgtc tacatacaaa aaatctttgc ttcatgttgg 
v p p g r s g p n v y 1 q k i f a s c *? 

121 cgactgcaag gtacttgccg gccaaaatgt ctaaaaaacg aacaatatcg tattttgtgt 
r 1 q g t c r p k c 1 k n e q y rile 

181 gatactatac atttgtgctg tgtaaaccca aaatatttac ctatactgac tgggaaatag 
d t i h 1 c c v n p k y 1 p i 1 t g k 

^ INSPmR_Rl 

241 ttgtg 



Position and sense of PCR primers 
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Figure 9: Map of pCR4-TOPO-INSP108 
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Figure 11: Map of expression vector pEAK12d 

Molecule: pEAK12 d, 87 60 bps DNA Circular 

File Name: pEAK12DEST . cm 5 

Description: Mammalian cell expression vector (plasmid ID 1134 5) 
Molecule Features: 
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Figure 12: Map of expression vector pDEST12.2 
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Figure 13: Map of pDONR221-INSP108-6HIS 



Molecule: pDONR22l-INSP108-6HIS , 2805 bps DNA Circular 
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Figure 14: Map of pEAK12d-INSP108~6HIS 
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Figure 15: Map of pDEST12.2-INSP108-6HIS 
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Figure 16: Predicted nucleotide sequence of INSP109 with translation 

1 atgaacctct gtctttctgc attactcttc ttcctggtga tcttactgcc ttcaggaaaa 
m n 1 els a 1 1 f f I v ill p s q k 

61 ggtatgtttg ggaatgatgg agtcaaagtt cgcacctgca etagecagaa ageegtatgt 
2 m f g n d g v k v r t c t s q k a v c 

121 ttcttcgggt gtccgccagg atacaggtgg attgegttet gecacaatat tctgtcttgc 
f f g c p p g y r w i a f c h n i 1 s c 

181 tgtaaaaata tgacacgttt tcaacccccg caagecaaag atccatgggt teat 
c k n in t r f q p p q a k d p w v h 



Underlined= signal peptide 
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Figure 17: INSP109 coding exon organization in genomic DNA and positi 
primers 
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Figure 18: Nucleotide sequence and translation of cloned INSP109 ORF 



1 atgaacctct gtctttctgc attactcttc ttcctggtga tcttactgcc ttcaggaaaa 

m n 1 els a 1 I f flv ill p s g k 

61 ggtatgtttg ggaatgatgg agtcaaagtt cgcacctgca etagecagaa ageegtatgt 

g m f g n d g v k v r t c t s q k a v c 

121 ttcttcgggt gtccgccagg atacaggtgg attgegttet gecacaatat tctgtcttgc 

f f g c p p g y r w i a f c h n i 1 s c 

181 tgtaaaaata tgacacgttt tcaacccccg caagecaaag atccatgggt tcatta 

c k n xa t r f q p p q a k d p w v h 
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Figure 19: Map of pCR4-TOPO-INSP109 

25 Nov 2003 Molecule Features 

Molecule: pCR4-TOPO INSP109, 4193 bps DNA Circular 

File Name: 13984 [1] . cm5 

Description: Ligation of inverted INSP109 assembled insert into pCR4- 
TOPO linear vector* 
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Figure 20: Map of pDONR-INSP109-6HIS 
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Figure 21: Map of pEAK12d-INSP109-6HIS 
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Figure 22: Map of pDEST12.2-INSP109-6HIS 



Molecule: pDEST12 . 2-INSP109-6HIS, 5887 bps DNA Circular 
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